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Abstract 

Let Sn,k denote the random geometric graph obtained by placing 
points in a square box of area n according to a Poisson process of 
intensity 1 and joining each point to its k nearest neighbours. In [T] 
Balister, Bohobas, Sarkar and Wahers conjectured that for every < 
e < 1 and all n sufficiently large there exists C ~ C{e) such that if 

^{Sn,k connected) > e 

then 

F{Sn,k+c connected) > 1 — e. 

In this paper we prove this conjecture. 

As a corollary we prove that there is a constant C" such that when- 
ever k{n) is a sequence of integers with 

IP(5'n,fc(n) connected) — > 1 as n — > oo, 

then for any integer sequence s{n) with s{n) ~ o(logn), 

IP('5',i,fc(„)+[C'siogiognj s-connected) ^ 1 as n oo. 

This proves another conjecture of Balister, Bollobas, Sarkar and Wal- 
ters 131 . 
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Introduction 

Let Sn be the square [0, ^/n\ x [0, ^/n\ C and let k be an integer. Place 
points in Sn according to a Poisson process of intensity 1 and put an undi- 
rected edge between each point and its k nearest neighbours. Let S^^k be 
the resulting random geometric graph. 

Several authors (see below) have considered the following question: for 
which k is Sn,k connected? Of course, it is always possible for Sn,k to fail to 
be connected, no matter how large k is; the best we can hope for is that Sn.k 
is connected 'asymptotically'. Formally, given a function A;: N — t- N and a 
property Q of geometric graphs, we say that Sn^k{n) has a property Q with 
high probability (abbreviated to whp) if 

lim P(5„ fcc„) has property Q) = 1. 

We remark that what this is saying is that the probability a random point 
set gives rise to a graph with property Q tends to one. 

Elementary arguments indicate that there exist constants q and Cu such 
that for every c < ci, Sn^idogn] is whp not connected while for every c> Cu 
5„ i^ciognj is whp connected. Using a result of Penrose [6], Xue and Ku- 
mar [lU] showed that c„ < 5.1774. A bound of c„ < 2 log ~ 
3.8597 can also be read out of earlier work by Gonzales-Barrios and Quiroz [5]. 

These results were significantly improved by Balister, Bollobas, Sarkar 
and Walters in [H [2] where they established the existence of a critical con- 
stant : 0.3043 < < l/log7 ^ 0.5139 such that for any c < Sn^idog] 
is whp not connected and for any c> 'S'n,[ciognJ is whp connected. They 
also made the following conjecture about the sharpness of the transition. 

Conjecture (Conjecture 3 of [1]). For any < e < 1, there exists an integer 
constant C(e) such that for all n sufficiently large, if 

^iSn,k is connected) > e 

then 

^{Sn,k+c{e) is connected) > 1 — e. 

The main result of this paper is the following theorem which proves the 
conjecture for an explicit function C(e). 

Theorem 1. There exist absolute constants C > and 7 > such that for 
every < e < 1 and all n > e~'' , if 

^{Sn,k is connected) > e 
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then 

IF'('S'n,fe+LCiog(i/£)J is connected) > 1-e. 

In [3] Balister, Bollobas, Sarker and Walters proved a weaker variant 
of their conjecture which they used to show that if /e = k{n) is such that 
Sn,k{n) is connected whp then for any s = o(logn) the graphs Sn^k'{n) where 
k'{n) = k{n) + [<o^{s - l)log nJ are whp s-connected in a technical sense of 
'on average'. As an immediate corollary to Theorem [H we may remove the 
somewhat complicated hypothesis that they needed in the statement of their 
result: Theorem 10 of |3] (admittedly with a weaker constant). Moreover, 
in the final section we strengthen this substantially proving the following 
theorem. 

Theorem 2. Whenever kin) is an integer sequence such that S^^kin) 
whp connected and s{n) is an integer sequence with s{n) = o(logn), then 
'S'n,fc(n)+L2CsiogiognJ whp s-connectcd. 

This proves the main conjecture in [3]. 

Before we describe the structure of our paper, we briefly contrast the 
k nearest neighbours model with another classical random geometric graph 
model introduced by Gilbert [1]. As before, let Sn be the square [0, x 
[0,^/n\ C M^. Let r be a real number. Again, place points in Sn according 
to a Poisson process of intensity 1 but this time put an undirected edge 
between any pair of points which lie at a distance of at most r from one 
another. We denote by Gn.r the resulting random geometric graph model. 
Gn,r is often known as the Gilbert disc model. Penrose [Q [3 [8] proved very 
precise results on the connectivity of Gn^r- In particular he showed that 
isolated vertices are the main obstacle to connectivity in the sense that whp 

inf{r > : Gn,r is connected} = inf{r > : Gn,r has no isolated vertices}. 

The situation is quite different for the k nearest neighbours model, which 
has no isolated vertices nor any immediately apparent analogous family of 
geometric obstructions to connectivity — indeed the the value of the crit- 
ical constant c^, is not known (although it may well be the lower bound of 
0.3043... proved in [I]). 

One motivation for the study of Sn,k (and the Gilbert disc model) comes 
from the theory of ad-hoc wireless networks. We imagine that we have 
various radio transmitters (nodes) that wish to communicate using multiple 
hops. The transmitters could have fixed range which naturally corresponds 
to the Gilbert disc model, or they could adjust their power so that each 
node has some fixed number of neighbours which is exactly the fc-nearest 
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neighbour model. In this context Theorem [2] is a result about the fault 
tolerance of such a network: it says that we can have a fault tolerant network 
for very little additional cost over the minimum needed for communication. 

Outline of Paper 

In the first section, we adapt techniques first introduced in [2] to relate the 
global property of connectivity to certain families of local events: these will 
be events determined by the Poisson process inside a square of area of order 
logn. 

In the second section we prove a geometric lemma which is crucial to 
our argument, establishing that 'small' connected components in 5„^fc have 
a region of 'high point density'. 

In the third section we show that removing points from such a dense 
region results in a much more likely configuration which still gives rise to a 
small connected component in the /c'-nearest neighbour graph for some k' a 
little smaller than k. In other words the graph Sn^y is much more likely to 
be disconnected than Sn^k which is exactly Theorem [TJ 

In the final section we prove Theorem [2j 



1 Local obstacles to connectivity 



Following [2], we shall relate the global connectivity oi Sn,k to certain families 
of local events. Let M be an integer constant which we shall specify later 
on. Let Un be the square 



Ur,. 



-MVlogn MvTog 



n 



-My/logn MvTog 



n 



C 



We shall refer to the subsquare as the central subsquare of Un- Place 
points in C/„ according to a Poisson process of intensity 1, and put an undi- 
rected edge between any point and the k points nearest to it to obtain the 
random geometric graph Un,k- 

We define Ai^ to be the event that U^^k has a connected component 
wholly contained inside the central subsquare ^f/„,. First, note that our Aj. 
event is slightly different from the family of events defined in [2]: there the 
size of the box corresponding to C/„ varied with k rather than logn. One of 
the advantages of our definition of Un is that the A^-events are nested: if 
k < k', then Ak' ^ Ak- We shall cover most of 5„ with copies of Un and 
show (approximately) that Sn,k is disconnected if and only if the event Ak 
occurs in one of these copies. 
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For this argument to work we need to ensure that whp Sn,k contains 
no 'long' edges (relative to Mylogn) and only one connected component of 
'large' diameter. The following result is exactly what we want. 

Lemma 3 (Lemma 1 of [2] ) . For any fixed a\ , 02 with < ai < 02 <ind 
any /3 > 0, there exists c = c{ai,a2, /3) > 0, depending only on ai,a2 and 
(3, such that for any k with ai log n < k < 02 log n, the probability that Sn,k 
contains two components each of diameter at least cy/logn or any edge of 
length at least c-^log n is 0{n~^). 

Remark: Li this paper we use the O notation in a slightly non-standard 
way. Most of our results depend on n and k where k = k{n) is a function 
of n. When we say f{n,k) = 0{n) we mean 'uniformly in A;': that is there 
is a constant B such that /(n, A;) < Bn for all n and k (satisfying our other 
constraints). 

Let M = max ( [16c(0.3, 0.6, 2)] , 30) . In our argument we shall also need 
the following lemma, which is an easy modification of Corollary 6 of [2]. 

Lemma 4. For any n and any integer k with 0.3 log n < k < 0.6 log n, the 
probability that Un,k contains an edge of length at least ^-^^^"g" is 0{n^^). 

Proof. This is very similar to the proof of Corollary 6 of [2] , but we have to 
make allowances for the slight difference in our definition of the event A^. 

Let k < 0.6 log n. Suppose some vertex x ^ Un has its k^^ nearest 
neighbour lying at a distance of at least Mv^°£li^ Then there must be fewer 
than < 0.6 log n points within a quarter-disc about x of area "^^^256*^ " ■ 
need to consider quarter-discs since x may be close to a corner of C/„.) Since 
we picked M > 30, we have ^^^256^ " ^ 10 log n. Let X ~ Poisson(lOlogn). 
Then, 

(10 log n 



P(X<0.61ogn)= If 

s<0.6 log n 

, ^ , / lOlogn \0-6i°s" 
<(0.61ogn) — — e 



0.61ogn/e^ 

<0.6(logn)e(0-6i°g(50e/3)-io)iogn 

< e~''^°s'^ for n sufficiently large. 

Thus the probability that any vertex x ^ Un has its k^^ nearest neighbour 
lying at distance at least ^^^g°^ " away is at most 

E{number of vertices in Un} x P(X < 0.6 log n) < M^{logn)e~'^^"^'^ 

= O (n-^) , 

as required. □ 
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We also need to define what we meant by 'most' of 5„. Let 
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The nice feature of r„ is that it is not very close to any of the boundary of 
Sn- The following lemma is a minor restatement of Theorem 1 of |9]. 

Lemma 5. There is a positive constant < ci < 2 such that if k > 0.3 log n 
then the probability that Sn,k contains any component of diameter 0(\/log n) 
not wholly contained in Tn is 0{n~'^^). 

We now define two covers of T„ by copies of Un- The independent cover 
Ci of Tn is obtained by covering T„ with copies of Un with disjoint interiors. 
The dominating cover C2 of T„ is obtained from Ci by replacing each square 
y G Ci by the sixteen translates V + (^iMV^jMV^)^ ij ^ {0, 1, 2, 3}. 
By construction, we have Ci Q C2 and the copies of ^Un corresponding to 
elements of C2 cover the whole of T„. Also IC2I < 16 j^a"og„ - 

We shall write 'A^ occurs in Cj' as a convenient shorthand for 'there is 
a copy V of Un in Ci for which the event corresponding to occurs'. We 
shall also write Vk for the /c- nearest neighbour graph on V, and for the 
centre subsquare of V. 

Lemmas [3] and H] allow us to relate, up to some small error, the global 
connectivity to the local events A^. Before we make this relationship precise 
we need a technical lemma. 

Lemma 6. Suppose Sn^k contains no edge of length greater than " and 

that V G C2 is a copy of Un such that contains no edge of length greater 
than Mv^i^_ Then Sn,k has a connected component contained inside 
whenever the event corresponding to Af^ occurs in V . 

Proof. Let Ty denote the subgraph of Vk consisting of all edges with at least 
one end in ^V, and let Ts be the subgraph of Sn,k consisting of all edges 
with at least one end in ^V. We aim to show that Ty = ^s- Obviously this 
will imply the lemma. 

Trivially, 5'n,fc[^] is a subset of Vk- What extra edges can there be in 14? 
We are assuming that Sn,k contains no edges of length greater than My^°Kll^ 
Thus only the vertices within distance of the boundary of V may 

be joined in Sn,k to points in Sn \ V. So every edge in Vk \ Sn,k[y] (i-e., all 
extra edges) must meet one of these vertices. 

Now Vk contains no edges of length greater than ^^^^n ^ qq that all the 
vertices meeting an edge of Vk \ Sn,k[V] must lie a distance at most 



M^/]ogn My/\og n M-^/log n 
8 ^ 16 ^ 4 
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from the boundary of V. Since the vertices inside the central subsquare 
all lie at distance at least " from the boundary of V, they do not meet 

any extra edges, and we have Ty = Ts as claimed. □ 

Theorem 7. For alln & N and all integers k with 0.3 log n < k < 0.6 log n, 

and ci as given by Lemma O 

^iSn,k not connected) = F{Ak occurs in C2) + 0(n^'^^). 

Proof. Suppose that Ak occurs in €2- Then there is a copy V of C/„ in C2 for 
which Ak occurs; in other words, Vk has a connected component X wholly 
contained inside the central subsquare ^V. By Lemma [3] and our choice 
of M, the probability that Sn,k contains an edge of length at least ^^^^'°s " 
is 0(n~^). Let us assume this does not happen. Then there are no edges 
between and Sn\V in Sn^k- It follows that X is a connected component 
in Sn,k as well as in Vk, so that Sn,k is disconnected. Thus 

F{Sn,k not connected) > ]P(^fc occurs in C2) + 0(n~^). 

Conversely, suppose Sn,k is not connected. It must contain at least two 
connected components. By Lemma [3] and our choice of M, the probability 
that r„ contains any edge of length at least ^"^^'g^" or two components of 
diameter at least '^^^g'^" is at most 0{n^'^). By Lemma [5] the probability 
that there is a small component not contained entirely within r„ is 0{n^^^). 
Also by LemmaUl the probability that Un^k has any edge longer than ^"^^g"^" 
is 0(n^^). The probability that Vk has an edge longer than ^^^^ " for some 
copy V of Un in C2 is therefore at most \C2\0{n~^) = 0(n~^). Thus the 
probability of any of the above occuring in Sn,k is at most O (n"'^^). 

From now on let us assume none of the above occur. Then at least one 
of the connected components of Sn^k is contained in r„ and has diameter 
less than ^"^^'g^" - Let X be such a component and x be a vertex of X. By 
our definition of C2 there is a copy V of C/„ such that x ^ jV. For any point 
x' ^ ^V, we have d{x,x') > MxMKll^ gy our assumption on the diameter 
of X, we have that x' ^ X and hence X Q ^V. So X is contained entirely 
inside the central subsquare ^V. Now Vk and Sn,k satisfy the hypotheses of 
Lemma [6l hence the event corresponding to Ak occurs in V, and 

F{Sn,k not connected) < F{Ak occurs in C2) + Oin""^). 

The theorem follows. □ 
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Roughly speaking F{Ak occurs in C2) is of order j^^^i^k) so, from a 
heuristic perspective, Theorem [7] tells us that as we increase k the transition 
of Sn,k from whp not connected to whp connected happens at the same time 
as the transition from ¥{Ak) > ^ to F{Ak) < The following is a 

precise statement of this relationship. 

Corollary 8. There exists a constant C2 > such that for all e : < e < ^, 
all integers n > e~'^^ and all integers k : 0.3 log n < k < 0.6 log n, if 



holds then 



Conversely, if 



then 



^{Sn,k connected) > e 



^, , s . /'lA M^logn 
P(Afc) <elog( - j 



, , e M^logn 
n 



F{Sn,k connected) > \ — e. 



Remark: There is nothing special about the constants e and e^: we picked 
these values for later convenience, but all we needed was e > 2 and > 16. 

Proof. Suppose is connected) > e. The copies of Un contained in Ci 

have disjoint interiors, hence the event corresponding to occurs in each 
of them independently. Therefore 

¥{Ak occurs in Ci) = 1 - (1 - P(^fc))l'^^l. 

Now, 

P(^fc occurs in Ci) < T{Ak occurs in C2) since Ci C C2 

= P(5„_fc not connected) + 0(n~'^^) by Theorem [7] 

< l-e + 0(n-^i). 

Hence, 

{l-¥{Ak)f^\ >e + 0(n-^i). 

Hence, provided we chose C2 large enough, we see that, for all n > £~^^, the 
right hand side is at least |. Taking logarithms on both sides and using the 
inequality log(l — x) < —x for < x < 1 yields 



-\Ci\F{Ak) > log(e/2) 
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so 

F(A.)<^(log^)=^(logl + log2 



Now Ci contains ^^,^^^^ (1 + 0(y'^)) copies of < e < ^ and e > 2. 
Hence, provided that we choose our constant C2 sufficiently large, for all 
n > £~^'^ we have 

^, , , eM^logn , 1 
P Afc < ^log-. 

n £ 

For the converse suppose that P(Afc) < £i±-^. By Theorem[7] we have 

P(S'„^fc not connected) = P(^fc occurs in C2) + 0{n~^^) 

< \C2mAk) + 0{n-'') 
,^ , M^logn , 



<^^ + 0(n--) since IC2I < 



Since < e < ^ and ^ < 1, we have (again providing we chose C2 
sufficiently large) for all n > e"'^^, 

F(Sn,k not connected) < e. 

□ 



2 Small components have high point density 

Having made precise the relationship between P(Afc) and ¥{Sn^k connected), 
we turn our attention to ■ Our aim in this section is to show that provided 
k > 0.3 log n, small connected components in Un^k witnessing Af^ must have 
a region with 'high point density'. 

Let be an integer constant whose value we shall specify later. We 
consider a perfect tiling of Un by square tiles of area (Such a perfect 

tiling exists as C/„ has area M^logn and M, N are integers.) The expected 
number of points of the Poisson point process on Un in each tile is ^^^r^. 
Fix < rj < ^. Given a tile Q, we say that the event A^ q occurs if A/^ 
occurs and the tile Q receives more than (1 + r])^^^f^ points. Similarly, we 
say that the event A'f, g occurs if A/^ occurs and the tile Q receives more 
than (1 + 2)121^ points. 

Lemma 9. Suppose k £ [0.3 log n, 0.6 log Then 

F{Ak\[jA,^Q) = 0{7^~'-'). 
Q 
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The main idea of the proof of this geometric lemma is the fohowing: 
suppose X is a connected component of Un,k wholly contained inside ^Un, 
and suppose a; is a vertex of X which lies 'on the boundary' of X. Write r 
for the distance between x and its k-th. nearest neighbour. 

If [/„ contains no tile with high density (i.e. no tile receiving more 
than (1 + r/) times the expected number of points), then then intersection 
of the ball of radius r centred at x with the 'convex hull' of X must have 
large area (about — o{k)). In particular looking outwards from X at x 
there must be quite a few empty tiles. Doing the above in several different 
directions one gets that X is surrounded by a wide 'sea' of empty tiles of 
area at least l.llogn. Since the number of tiles M'^N'^ is a constant, the 
probability that such a collection of empty tiles exists is 0{n~^'^), yielding 
the desired result. 

Before we start, we need the following technical result. 

Lemma 10. Let 7 : [0, 1] — t- C/„, be a closed continuously differentiahle curve 
in Un- Let Z(r) he the length of the curve T = 7([0, 1]), and let D he the 
numher of tiles it meets. Then 

D< ^\ . 

V log n/iV 

Proof. We define a graph G on the set of tiles of Un by setting an edge 
between tiles Q and Q' if they meet in at least one point. [G is just the 
usual square integer lattice on {1,2,... MN}'^ with diagonal edges added.) 
Every tile has at most 8 neighbours in this graph. Let S be the set of tiles 
met by F. Greedily pick a maximal subset S" C S" which is independent 
in G: pick the tile Qi with 7(0) G Qi, then pick the first nonadjacent tile 
Q2 which 7(t) next meets and so on. We have D = \S\ < 9\S'\. Now T 
is continuous and cycles through the tiles of S' before coming back to Qi. 
Since the minimum distance between points lying in nonadjacent tiles is at 
least one tile length (i.e., " ) , it follows that the length of T satisfies 

Substituting D < 9\S'\ and rearranging terms, we get the desired inequality 

9/(r) 



D < 



"v/log n/N 

□ 



Proof of Lemma\^ Let k be an integer with 0.3 log n < k < 0.6 log n. By 
Lemma H] the probability of C/„ k containing any edge of length at least 
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Figure 1: The hexagonal huh H and regions I2,J2, K2 and "^2- 



A/V^og " jg Q^'j^ Since we are trying to show \|Jq ■^k,Q has probabiHty 
at most 0(n~^'^), we may assume in what fohows that ah edges in ?7„ ^ have 
length strictly less than Mv^E^. 

Suppose P is a pointset for which occurs but Afc,Q does not occur 
for any tile Q. Write Un^ki'P) for the k nearest neighbours graph on C/„ 
associated with the pointset V. Let X be the set of vertices of a connected 
component of wholly contained in ^Un- Using an idea of Balister, 

Bollobas, Sarkar and Walters [1], we shall consider the hexagonal hull of X, 
H{X), which we now define. 

We consider the six tangents to the convex hull of X making angles of 
0, ^ and ^ with the rr-axis (two for each angle). Together, these define 
a hexagon H{X) containing X whose edges are segments of the tangents 
(some of which may have zero length). We shall call H{X) the hexagonal 
hull of X, and label its edges Ei, E2, ■ ■ ■ Eq in cyclic order so that the top 
and bottom edges parallel to the x-axis are E2 and £"5 respectively . 

Consider Ei. There exists xi & Ei D X. Let ri be distance between 
xi and its A;-th nearest neighbour. Let Ji be the intersection of the ball of 
radius ri centred at xi with the hexagon H{X). Let ![ be the reflection of 
Ii with respect to Li. Since I[ C Un \ H{X) and since every point of lies 
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at distance at most ri from xi, it follows that I[ contains no point of V. We 
shall show that I[ covers many tiles. 

Let Ji be the union of all of the tiles wholly contained inside Ii, and let 
J[ be the union of all of the tiles wholly contained inside /( . Let Ki be the 
union of all of the tiles meeting Ii and let K[ be the union of all of the tiles 
meeting ![. Since no tile in C/„ contains more than (1 + points of V, 

it follows that Ki is the union of at least logn/TV^ tiles. 

A tile is contained in Ki \ Ji only if it meets the boundary of Ji. Now, 
since Ii is a convex subset of a disc of radius ri, the boundary of Ii has length 



187rr 



less than 27rri , so by Lemma [TOl K-i \ Ji is the union of at most r. 

' •' ' ^ •f/logn/N 

tiles. By the same argument, K{ \ J{ is the union of at most }^^^\r tiles. 

^ ' 1 \ i •yiogn/N 

Denote by the area of Ii, and similary |/(|, | Ji|, . . . \ K[\. We have 
|J(|>|/1|-|A'1\J(| 

>\K,\-\Ki\Ji\-\K[\J[\. 
Now each tile has area ^^^j^. We therefore have 



\J[\> 



logn / k SGvrri 



> 



Af2 \{l+r])\ogn/N'^ ^bgT^/iV 
k SGvrri y/log n 



l + T] N 

k 9M vr log n M \J\og n 

> — since ri < . 

-1 + 7? 2N 8 

We turn at last to the choice of N: let N = 10[27Mvr]. For k > O.Slogn 
and T] < ^, the above becomes: 

141 >^ logn. 

For i = 2, S . . . 6 we may define Ii, I[, etc... as above. It is easy to see 
that the J^' are disjoint: each J'^ lies between the bisectors of two adjacent 
angles of the convex hexagon H{X). Repeating the argument above to 
bound below | J2I) ■ ■ ■ l-^el' S^*- 



i=l 



i=l 
lU 

> — log n. 
10 ^ 
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Thus there are at least ^ log n / {log n / N'^) = 110([27M7r])^ tiles which 

receive no points. There are at most (iio(|27M7r])^) "^^Y^ of choosing this 
many tiles. Since M and N are constants this is just a (large) constant. 
The probability that there exist 110([27M7r])^ empty tiles (i.e., empty tiles 
with total area log n) in Un is therefore 

0(exp(-^logn)) = 0(n-i-i). 

Thus 

F{Ak\[jAk^Q) = 0{n-'-'), 
Q 

as claimed. 

□ 

3 The sharp connectivity threshold for Sn,k 

In Lemma [9] of the previous section we proved that small components wit- 
nessing Ak have high point density. We use this fact to prove a sharpness 
result for ¥(Ak), which by Corollary [8] implies in turn a sharp threshold for 
the connectivity of Sn.k (i-e., Theorem[T]). We shall do this by showing that, 
for all k' > k, most pointsets in ^4^/ may be obtained by adding points to 
already dense parts of Af^ pointsets. 

We shall need the following lemma, which is a convenient restatement of 
Theorem 5 of pQ. 

Lemma 11. There exists a positive constant C3 > such that for every e 
with < e < ^ and all n > e~'^^ , 

if k < 0.3 log n, then ¥{Sn,k connected) < e, 
and if k > 0.6 log n, then ¥{Sn^k connected) > 1 — e. 

Recall that in the previous section we fixed constants < 77 < ^ and 
N £ N and introduced a tiling of C/„ into M'^N'^ small square tiles as well as 
the families of events Ak^q and A'f^ q. Lemma[9]says that provided ¥{Ak) = 
n{n-^), we have PdJg^fc.Q) = U - C'(n-°-^))P(Afc). Thus if a small Ak 
connected component occurs, then with high probability some tile Q receives 
far more points than expected. We show that if A;' > A; then most A).' 
pointsets can be obtained by adding points to an overpopulated tile of an 
A); pointset. 

We need one more piece of notation: given a tile Q let ^fc,Q,L be the 
event that if we remove any L points from Q then A'^ q occurs. 
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Lemma 12. For any tile Q and positive integer L < 2n'^ have 

Proof. Suppose that "P C [/« is a pointset for which the event ^a;+l,q occurs. 
It is enough to show that the removal of any L points from V CiQ yields a 
pointset V' for which the event A'j^ q occurs. 

As in Lemma O write Un^ki'P) for the k nearest neighbours graph on C/„ 
associated with the pointset V. Since we remove at most L vertices from V 
every vertex in V loses at most L of its A; + L nearest neighbours; the set of 
its k nearest neighbours in V' is thus a subset of the set of its k + L nearest 
neighbours in V. It follows that Un^ki'P') is a subgraph of Un^k+iiV)- 

Un,k+L{'P) has a connected component wholly contained inside ^Un- 
This component must contain at least k + L + 1 > L vertices and since 
we have removed only L vertices from V to obtain V' some vertices of this 
component remain: that is, Un^ki'P') must also have component wholly con- 
tained inside ^Un- Thus V £ A^. 

Moreover the number of points in "P' n Q is exactly 

|Png|-L>(i + r?)^-^^ = (i + -)^ 

and hence T" S A'f^ q as claimed. □ 
Corollary 13. Let L < ^^^r^ be a positive integer and Q a tile. Then 

F{Ak+L,Q) < (1 + |)-^PK,q). 

Proof. First, note that we may consider the Poisson process on C/„ as the 
union of a Poisson process on Q and an independent Poisson process on the 
disjoint set Un\Q. Now a Poisson point process on Q is just a uniform point 
process placing 

' log n ^ 



Poisson 



iV2 



points in Q. 

We may think of this uniform point process as adding points one by one. 
If Aj^ Q j^ occurs then in particular A'^^ q occurs if we remove the last L points 
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added by the point process. It follows that 

nM+L,Q) < IP(A-,q,l) by Lemma 12] 

= P(Afc,Q,i|Z = m + L)F{Z = m + L) 

m 

< P(^fc,Q \Z = m)¥{Z = m + L) by definition of ^fc,Q,L 

m 
m 

By the definition of q, 



m + 1 

i=l 



nA'k,Q\Z = m) = for all m < (1 + 



For m > (1 + 3)^7^?^, we have 



— —) <i^ + i) ■ 

m + 1 \ m I 2 

i=l 



It follows that 

P(Afc+i,Q) < (1 + ^) "P«q), 

as claimed. □ 



Theorem 14. There are constants C4 and L G N suc/i that for all n > 04 
and all k with 

k G [0.3 log n, 0.6 log n] andF{Ak) > n~^-°^ 

we have: 

F{Ak+L) <e-^F{Ak). 

Proof. Let L be an integer constant which we shall specify later on. As rj, 
L and N are all constants, for an appropriate choice of our constant C4 > 
and all n > C4, we have L < ^^^^ so that the hypothesis of Corollarv [TBI 
is satisfied. Also k G [0.3 log n, 0.6 log n], so the hypothesis of Lemma [9] is 
satisfied as well. Applying the two lemmas successively, we get: 

FiAk+L) = P(|J Ak+L,Q) + O(n-i-i) by LemmaO 

Q 

<^P(Afc+L,Q) + 0(n-i-i) 
Q 

^ + + by CorollaryttSl 

Q 

< M^N\l + |)-^P(AO + O(n-i-i) (since F{AIq) < F{Ak).) 
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We now choose L: let 

log {M'^N^e'^) 



L 

so that 



log(l + i) 



By assumption P(^fc) > n"^ *^^, so for an appropriate choice of our constant 
C4 > and all n > C4, we have 

as claimed. (Note that the choice of our constant L depended only on the 
constant M, N and rj.) □ 

Proof of TheoremUi In essence, we just iterate Theorem I14i However, we 
have to choose the right parameters and make sure the conditions hold at 
each stage. 

We choose 7 > such that 7 > max (c2, C3, log2 C4, 20). Note that, since 

M > 30, we have < < 0.09 for all n > 2 so rT^h > 

e^„-0.05 f ^ > 2. 

M ^ log n — 

Suppose that n and k are such that P(«S'n^fc is connected) > e and n > 

. We may assume that e < \ and V{Sn^k connected) < 1 — e, for 
otherwise we have nothing to prove. Since n > e~'^ > e~^^ and e < 
¥{Sn^k connected) < 1 — e, Lemma [TT] implies that 0.3 log n < k < 0.6 log n. 
Thus, for n > , the assumptions of Corollary [5] and Theorem [H] are 
therefore satisfied. 

Let C be a strictly positive real constant which we shall specify later on. 
There are three cases to consider. 

Suppose first of all that 

k+ [Clog^J > 0.6 log n. 

Then by Lemma [TT] we have F'{Sn^k+lciog(i/£)\ connected) > 1 — e, and we 
are done. 

Secondly suppose that k + [Clog^J < 0.6 log n and 

P(^fc+LClog{l/e)j) < 

Since n > , 

-1.05 / ^i/^M^logn logn 
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so that by Corollary [8] we have ^{Sn^k+lciog{i/£)\ connected) > 1 — e, and 
we are done. 
Finally if 



/c+ [Clog < 0.6 log n and F(^fc+[ciog(i/e)J ) > 



-1.05 



then since P(Afc') monotonically decreases as k' increases we have ¥{Aigi) > 
£qj, gvery k' : k < k' < k + [Clog^\. Thus by Theorem [H we have, 
for all A:' : A; < yfc' < A:+ [Clogij, 



F{Ak'+L) < e 



k' ■ 



Since k < 0.6 log n, P(S'„ fc connected) < 1 — e implies by Corollary [8] that 

LClogl/eJ 



log - . Thus 

n ° e 



IP(A+Lciogij) <exp 

< exp 

< exp 

We now choose C: let 



L 

Clogl/e 
I 

Clogl/e 

z 



C= 2 + 



P(Afc) 

eM^ log n 1 



n 



log- 



log 2 



+ 1 + log log 1/e 



L. 



Af ^ log n 



n 



Since e < ^, we have that > 1. Thus for this choice of C we have 



Clogl/g 
L 



+ 1 + log loe 



e 



1 Clogl/e 
< 2 + log log 

e L 

ologlZiA^n 1 1 1 ^ 41ogl/e 1 
= (2 - 2- — — + log log - - log — log - 

log 2 £ £ log 2 £ 



< -4 - loe 



1 



Substituting this in the above bound for P(^fc+[ciogi/ej) we get 

By Corollary [H] this implies 

^{Sn,k+\c\ogi/e\ connected) > 1 - e, 
proving the theorem. 



□ 
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4 Higher connectivity 

In this section, we shall apply our sharpness result Theorem [1] to prove The- 
orem [21 proving a conjecture of Balister, Bollobas, Sarkar and Walters [3]. 
Suppose that V is any pointset in in the square Sn = [0, i/n]^. As before, 
let GkiV) denote the k nearest neighbour graph on V. 

Lemma 15. Suppose Sn,k is the random geometric graph with k an integer 
lying between 0.3 log n and 0.6 log n. Let s < O.llogn. Then there is a 
constant cg such that 

'^{Sn,k f^ot s-connected) < C6(log n)P(S'„_fc_i not (s — 1)- connected) +0{n~^). 
Moreover 

IF'(5'n,fc not s-connected) < (cg log n)^~^P(S'„^fc_j,+i not connected) +0 (^n~^ log 
We shall need the following technical result to prove Lemma [T5l 

Lemma 16. Suppose 0.3 log n < k < 0.6 log n. Then there exists cj such 
that the collection of pointsets V from which we may delete at set T of at 
most 0.1 log n points so that either of the following hold: 

• there is any point x £ Sn (not necessarily inV) with [0.6 log n\ -nearest 
neighbour radius inV\S at least cj^/log n 

• Gk{V)\T contains at least two components of diameter at least cjy/log n 
has probability 0{n~^). 

Proof. This is an easy modification of Lemmas 2 and 6 of [1] □ 

Proof of Theorem\15[ We can view the Poisson distribution as follows. Sup- 
pose that Xi , X2 , X-^ , ... is an infinite sequence of uniformly distributed ran- 
dom variables in Sn and let Z ~ Po(n). Then let the points in V be given 
by {Xi)f^^. Let Vm denote the collection of pointsets with exactly m points 
which we give the conditional measure which we shall sometimes denote IPtt^. 
From this point of view it is easy to see that we have m measure preserving 
maps for 1 < i < m from Vm to "Pm-i given by deleting the point Xi. 
We shall usually abbreviate 4>i to 0. 

Let As denote the collection of pointsets V for which Gk{V) is not s- 
connected but Gk-iiV) is (s — l)-connected. Let Bg denote those pointsets 
V for which Gk^iiV) is not {s — l)-connected. Finally let C denote the 
collection of pointsets V for which either of the conditions in Lemma[T6lhold. 
which we shall think of as the 'bad' pointsets. By Lemma fTBl P(C) = 0{n~^). 
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For any pointset V in As it is clear that (at least) one of the functions 
(f)i maps V into Bg. Indeed, since GkiV) is not s-connected, there is a point 
Xi which we can delete to make the graph not (s — l)-connected. Since 
Gk-i(V \ Xi) is a subgraph of Gk(V) \ Xi the map 4>i is one such function. 

Thus As<^\JT=i<t'7\l3s)- 

Note that F{\Z - n\ > n/2) = oifT'^l'^). We have 

oo 

P(^,) = ^ P(A|2' = m)P(Z = m) 

m=0 
3n/2 

= ^ P(A|^ = m)P(Z = m) + o(e-"/2) 

m=n/2 
3n./2 

= IPm(A \ C)P(Z = m) + 0(n-3) 

m=n/2 
3n/2 

= ^ P^(T' G A \ C and 3z : (/.^(T') € i3,)P(Z = m) + 0(^-3) 

m=n/2 
3n/2 m, 

- ^ J^Pm(PG A\Cand0,(P) Ge,)P(Z = m) + 0(n-3) 

m=n/2 't=l 
3n/2 

= ^ mP^(P G A \ C and (\>{V) G fi.)P(Z = m) + Oin'^). 

m=n/2 

Now consider Pm(P As\C and G S^). For each V £ As\C with 

0(P) G 5<j we see that Gk-i{V) is (s — l)-connected but Gk--i{(t>{V)) is not 
(s — l)-connected. Fix a separating set T of s — 1 vertices for Gk-i{4>(V)). 
Since V ^ C we have that all but one of the components in the separated 
graph Gk^i{4>(V))\T are small: less than Cf\/log n in diameter. Fix one such 
component C. Since Gk-iiV) is (s — l)-connected we see that Gk-i{V) \ T 
is connected so Xi must be joined to C in Gk-iiV) and, hence, that Xi lies 
within distance cyylogn of C. Therefore Xi lies within a set of measure less 
than A-Kc^ log n which is determined by P \ Xi . This event has probability 
less than (^1^£l12II^) . Thus, as i;^ is a measure preserving transformation 
from Pm to Pm-i, 

P™(</<(P) G 
P^^ilT' G Bs) 



,(PG A\Cand .^(P)Gi3,)< 



47rc7 log n 
n 

47rcy log n 



n 
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To complete the proof note that ¥{Z = m) < 2P(Z = m — 1) for all 
m > n/2. Thus 

3n/2 

P(^s) < Yl G A \ C and (/){V) e e,)P(Z = m) + ©(n'^) 

r?i=n/2 

m=n/2 ^ ^ 
3n/2 

< {Uttc^j log n) Fm-iiV £Bs)T'{Z = m-l) + 0{n^^) 

m=n/2 

< (I27rcf log n)P(e,) +0(n-2). 
Finally observe that 

{V : Sn,k not s-connected} C U 

so that the first part of the lemma holds with cg = 12-1: Cj + 1: 

F{Sn,k not s-connected) < CQ\ognF{Sn,k~i not (s — l)-connected)+0(n^^). 

Iterating this s — 1 = O(logn) times we obtain the second part of our 
claim. □ 

We can now finally turn to the proof of Theorem [2j 

Proof of Theorem [H By Theorem 2 of we may restrict ourselves to the 
case where s{n) is an integer sequence with s{n) < 2^ log log n ■ Suppose that 
k = k{n) is such that Sn,k is connected whp, so that 

F{Sn,k is not connected) — )• 0. 

By Theorem [T] with e = (cglogn)"**, 

^{Sn,k+lciogi/e\ is not connected) < e 

for all sufficiently large n. (Explicitly, this is for all n with n > e~"'. 
Given our choice of e and the restriction on s, is at most exp(| logn + 
^( log'iogn ))' ®° ^^^^ ^^i® i^ indeed satisfiable for large enough n.) Now 

C log - + s — 1 < 2Cs log log n 

e 
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for all sufficiently large n. If A; + [2CsloglognJ < 0.6 log n, we have by 
Lemma 1151 

IP(5'„,fc+L2CsiogiognJ not s-connected) 

< (c6logn)''^^P(S'„^fc-^i+i+L2CsiogiognJ not connected) + O (n^-'^ log n) 

< (c6logn)*~^P(5„ fc_|_|^(;iogi/£j not connected) + O (n~^logn) 

< (c6 log ny~^e + O {n~^ log n) 
= 0(l/logn) = o(l) 

as required. If on the other hand k + [2CsloglognJ > 0.6 log n, we have 

IP'('S'n,fc+L2CsiogiognJ IS not s-connected) = o(l) 

by Theorem 2 of [3]. The result follows. □ 
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